Gestalt: Unifying fault localization for networked systems
نویسندگان
چکیده
Researchers have proposed many algorithms for localizing faults in networked systems, but it is unclear which algorithm is best suited for a given network; the performance of these algorithms differs markedly for different networks. We develop a framework that can explain these differences by anatomizing the algorithms into their basic choices and analyzing these choices with respect to six defining characteristics of real networks. Our analysis also reveals that no existing algorithm simultaneously provides good localization accuracy and low computational overhead. Based on our insights, we develop a new algorithm called Gestalt. To perform well across a range of networks, Gestalt combines the good choices of existing algorithms and with a new method to explore the space of possible faults in a way that is both low overhead and robust to noise. We apply it to three real, diverse networks: an email network, a peer-topeer messaging system, and an ISP network. In each case, Gestalt has either significantly higher localization accuracy or an order of magnitude faster running time. For example, when applied to Lync [2], Gestalt localizes faults with the same accuracy as Sherlock [6], while reducing fault localization time from days to 23s on a single system.
منابع مشابه
Gestalt: Fast, Unified Fault Localization for Networked Systems
We show that the performance of existing fault localization algorithms differs markedly for different networks; and no algorithm simultaneously provides high localization accuracy and low computational overhead. We develop a framework to explain these behaviors by anatomizing the algorithms with respect to six important characteristics of real networks, such as uncertain dependencies, noise, an...
متن کاملDesigninga Neuro-Sliding Mode Controller for Networked Control Systems with Packet Dropout
This paper addresses control design in networked control system by considering stochastic packet dropouts in the forward path of the control loop. The packet dropouts are modelled by mutually independent stochastic variables satisfying Bernoulli binary distribution. A sliding mode controller is utilized to overcome the adverse influences of stochastic packet dropouts in networked control system...
متن کاملModelling and Compensation of uncertain time-delays in networked control systems with plant uncertainty using an Improved RMPC Method
Control systems with digital communication between sensors, controllers and actuators are called as Networked Control Systems (NCSs). In general, NCSs encounter with some problems such as packet dropouts and network induced delays. When plant uncertainty is added to the aforementioned problems, the design of the robust controller that is able to guarantee the stability, becomes more complex. In...
متن کاملThe Study for Guaranteed Cost Fault Tolerant Control of the Networked Control Systems
In this paper, the problem of guaranteed cost fault-tolerant control for networked control systems (NCSs) is discussed based on Lyapunov stability theory and Linear Matrix Inequality (LMI). The sufficient conditions possessing robust integrity against actuator failures are given by adopting memory state feedback control law, which can meet a cost function for closed-loop networked control syste...
متن کاملStabilization of Networked Control Systems with Variable Delays and Saturating Inputs
In this paper, improved conditions for the synthesis of static state-feedback controller are derived to stabilize networked control systems (NCSs) subject to actuator saturation. Both of the data packet latency and dropout which deteriorate the performance of the closed-loop system are considered in the NCS model via variable delays. Two different techniques are employed to incorporate actuator...
متن کامل